Candidate gene prioritization with Endeavour

نویسندگان

Léon-Charles Tranchevent

Amin Ardeshirdavani

Sarah ElShal

Daniel Alcaide

Jan Aerts

Didier Auboeuf

Yves Moreau

چکیده

Genomic studies and high-throughput experiments often produce large lists of candidate genes among which only a small fraction are truly relevant to the disease, phenotype or biological process of interest. Gene prioritization tackles this problem by ranking candidate genes by profiling candidates across multiple genomic data sources and integrating this heterogeneous information into a global ranking. We describe an extended version of our gene prioritization method, Endeavour, now available for six species and integrating 75 data sources. The performance (Area Under the Curve) of Endeavour on cross-validation benchmarks using 'gold standard' gene sets varies from 88% (for human phenotypes) to 95% (for worm gene function). In addition, we have also validated our approach using a time-stamped benchmark derived from the Human Phenotype Ontology, which provides a setting close to prospective validation. With this benchmark, using 3854 novel gene-phenotype associations, we observe a performance of 82%. Altogether, our results indicate that this extended version of Endeavour efficiently prioritizes candidate genes. The Endeavour web server is freely available at https://endeavour.esat.kuleuven.be/.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Endeavour update: a web resource for gene prioritization in multiple species

Endeavour (http://www.esat.kuleuven.be/endeavourweb; this web site is free and open to all users and there is no login requirement) is a web resource for the prioritization of candidate genes. Using a training set of genes known to be involved in a biological process of interest, our approach consists of (i) inferring several models (based on various genomic data sources), (ii) applying each mo...

متن کامل

Kernel-based data fusion for gene prioritization

MOTIVATION Hunting disease genes is a problem of primary importance in biomedical research. Biologists usually approach this problem in two steps: first a set of candidate genes is identified using traditional positional cloning or high-throughput genomics techniques; second, these genes are further investigated and validated in the wet lab, one by one. To speed up discovery and limit the numbe...

متن کامل

Integrating Computational Biology and Forward Genetics in Drosophila

Genetic screens are powerful methods for the discovery of gene-phenotype associations. However, a systems biology approach to genetics must leverage the massive amount of "omics" data to enhance the power and speed of functional gene discovery in vivo. Thus far, few computational methods for gene function prediction have been rigorously tested for their performance on a genome-wide scale in viv...

متن کامل

Title Integration of Multiple Data Sources to Prioritize Candidate Genes Using Discounted Rating System Integration of Multiple Data Sources to Prioritize Candidate Genes Using Discounted Rating System

Background: Identifying disease gene from a list of candidate genes is an important task in bioinformatics. The main strategy is to prioritize candidate genes based on their similarity to known disease genes. Most of existing gene prioritization methods access only one genomic data source, which is noisy and incomplete. Thus, there is a need for the integration of multiple data sources containi...

متن کامل

A Novel Prioritization Method in Identifying Recurrent Venous Thromboembolism-Related Genes

Identifying the genes involved in venous thromboembolism (VTE) recurrence is important not only for understanding the pathogenesis but also for discovering the therapeutic targets. We proposed a novel prioritization method called Function-Interaction-Pearson (FIP) by creating gene-disease similarity scores to prioritize candidate genes underling VTE. The scores were calculated by integrating an...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره 44 شماره

صفحات -

تاریخ انتشار 2016

Candidate gene prioritization with Endeavour

نویسندگان

چکیده

منابع مشابه

Endeavour update: a web resource for gene prioritization in multiple species

Kernel-based data fusion for gene prioritization

Integrating Computational Biology and Forward Genetics in Drosophila

Title Integration of Multiple Data Sources to Prioritize Candidate Genes Using Discounted Rating System Integration of Multiple Data Sources to Prioritize Candidate Genes Using Discounted Rating System

A Novel Prioritization Method in Identifying Recurrent Venous Thromboembolism-Related Genes

عنوان ژورنال:

اشتراک گذاری